Beyond Geometric Path Planning: Learning Context-Driven Trajectory Preferences via Sub-optimal Feedback

نویسندگان

  • Ashesh Jain
  • Shikhar Sharma
  • Ashutosh Saxena
چکیده

We consider the problem of learning preferences over trajectories for mobile manipulators such as personal robots and assembly line robots. The preferences we learn are more intricate than those arising from simple geometric constraints on robot’s trajectory, such as distance of the robot from human etc. Our preferences are rather governed by the surrounding context of various objects and human interactions in the environment. Such preferences makes the problem challenging because the criterion of defining a good trajectory now varies with the task, with the environment and across the users. Furthermore, demonstrating optimal trajectories (e.g., learning from expert’s demonstrations) is often challenging and non-intuitive on high degrees of freedom manipulators. In this work, we propose an approach that requires a non-expert user to only incrementally improve the trajectory currently proposed by the robot. We implement our algorithm on two high degree-of-freedom robots, PR2 and Baxter, and present three intuitive mechanisms for providing such incremental feedback. In our experimental evaluation we consider two context rich settings – household chores and grocery store checkout – and show that users are able to train the robot with just a few feedbacks (taking only a few minutes). Despite receiving sub-optimal feedback from non-expert users, our algorithm enjoys theoretical bounds on regret that match the asymptotic rates of optimal trajectory algorithms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning preferences for manipulation tasks from online coactive feedback

We consider the problem of learning preferences over trajectories for mobile manipulators such as personal robots and assembly line robots. The preferences we learn are more intricate than simple geometric constraints on trajectories; they are rather governed by the surrounding context of various objects and human interactions in the environment. We propose a coactive online learning framework ...

متن کامل

A hybrid feedback controller for car-like robots - combining reactive obstacle avoidance and global replanning

This paper presents a hybrid feedback controller for path control of autonomous mobile robots. The controller combines reactive obstacle avoidance with global path replanning, enabling collision-free navigation along a preplanned path. Avoidance of local obstacles is accomplished by adjusting the vehicle’s lateral deviation from the path trajectory reactively. Global path replanning is performe...

متن کامل

Fast Optimization Based Motion Planning and Path-Tracking Control for Car Parking

This paper presents a car parking control concept for real-time application. It utilizes a two-degrees-of-freedom control scheme consisting of a feedforward and a feedback controller. The reference trajectory is constructed in two steps. First a geometric path is planned by solving a local static optimization problem, which is formulated by discretizing the path. Second a pathfollowing problem ...

متن کامل

Learning Trajectory Preferences for Manipulators via Iterative Improvement

We consider the problem of learning good trajectories for manipulation tasks. This is challenging because the criterion defining a good trajectory varies with users, tasks and environments. In this paper, we propose a co-active online learning framework for teaching robots the preferences of its users for object manipulation tasks. The key novelty of our approach lies in the type of feedback ex...

متن کامل

Multiresolution Aircraft Guidance in a Spatiotemporally-varying Threat Field

We consider the problem of generating an optimal aircraft reference trajectory in a horizontal plane, where the objective is to minimize exposure to a spatiotemporally varying scalar field. We consider a special case, where this field is timeinvariant, and we discuss optimal trajectory generation for an aircraft kinematic model based on firstand second-order necessary conditions of optimal cont...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013